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Abstract. We consider the problem of constructing a communication infrastructure from scratch, 
for a collection of identical wireless nodes. Combinatorially, this means a) finding a set of links that 
form a strongly connected spanning graph on a set of n points in the plane, and b) scheduling it 
efficiently in the SINR model of interference. The nodes must converge on a solution in a distributed 
manner, having no means of communication beyond the sole wireless channel. 

We give distributed connectivity algorithms that run in time O (poly (log A, log n)), where A is 
the ratio between the longest and shortest distances among nodes. Given that algorithm without 
prior knowledge of the instance are essentially limited to using uniform power, this is close to 
best possible. Our primary aim, however, is to find efficient structures, measured in the number 
of slots used in the final schedule of the links. Our main result is algorithms that match the 
efficiency of centralized solutions. Specifically, the networks can be scheduled in O(logn) slots using 
(arbitrary) power control, and in 0(log n(log log A + logn)) slots using a simple oblivious power 
scheme. Additionally, the networks have the desirable properties that the latency of a converge-east 
and of any node-to-node communication is optimal O(logn) time. 



1. Introduction 

We consider the problem of constructing a communication infrastructure from scratch, for a 
collection of identical wireless nodes. Combinatorially, this means finding a set of links that form 
a strongly connected spanning graph on a set of points in the plane, and scheduling it efficiently in 
the SINR model of interference. The nodes must converge on a solution in a distributed manner, 
having no means of communication beyond the sole wireless channel. The issue is how quickly and 
how well: the time it takes to form the structure and the efficiency of the final schedule produced. 

The importance of creating a connected structure spanning a set of wireless nodes can hardly be 
overstated. This may underlie a "multi-hop" wireless network, where any two nodes can commu- 
nicate through path(s) specified by such a structure. In an ad-hoc network, such a structure may 
provide the underlying backbone for synchronized operation of the network. In a wireless sensor 
network, the structure can double as an information aggregation mechanism. 

The efficiency of a structure is closely intertwined with the issue of interference, the distinguishing 
feature of wireless communication. Interference implies that only a limited number of transmissions 
can be successful simultaneously; this number depending on spatial distribution of the links, power 
settings, etc. We adopt the SINR (or physical) model of interference, that has been shown both 
theoretically and experimentally to be a more faithful representation of reality than many of the 
traditional graph-based models [181 E2] • 

Achieving an efficient schedule involves deciding power levels for the links - which may either be 
fully instance-dependent ("arbitrary"), or be chosen in an "oblivious" manner, depending only on 
the length of each link. Recent centralized results show that it is possible to connect any link set 
using 0(log n) slots [11] . whereas the use of oblivious power is bound to involve a factor of log log A 
[E1I11III], where A is the ratio between shortest and longest distance in the network. 
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Achieving connectivity is a distributed problem par excellence. Distributed algorithms often 
assume "free" local communication. In contrast, since the purpose in this paper is to build a com- 
munication infrastructure from scratch, we assume that the only mode of communication allowed 
is transmission in the single wireless channel, which succeeds if the required signal-to-interference- 
and- noise ratio is achieved. We also do not assume a carrier sensing primitive (see, e.g., [26]) that 
allows nodes to estimate the amount of activity on the channel. 

Given that the nodes have no information about distances to nearby nodes, they are in effect 
limited to using a pre-defined fixed power initially. It is known that usage of such a simple power 
scheme can necessarily require a linear number of slots to connect the nodes [21J. A more refined 
bound is log A, where A is the ratio between maximum to the minimum distance among the nodes. 
We provide a distributed algorithm that forms a (initial) connected network in time 0(log A • log n), 
which is probably close to the best possible. 

The quality or efficiency of the final structure is another story. Once the initial (and possibly 
inefficient) network is formed, we are interested in retooling the network, still in a distributed 
fashion, but using the existing network, to find improved connectivity structures. We provide two 
approaches to this. First, we show that the initial network has nice geometric properties that 
allows us to use (distributed) power control to make it much more efficient. Second, we propose a 
more sophisticated approach — instead of simply changing the power settings of the links of initial 
network, we leverage the initial tree to construct new set of links (and their power settings) that can 
be scheduled even more efficiently, while still achieving connectivity. This suggest a novel interplay 
between different layers — a network layer (i.e., the initial tree) that goes back and retools both 
itself (choosing new links) and the MAC layer (changing power settings and schedules). 

The challenge raised in this paper can then be stated as follows: 

Is there a distributed algorithm, running in time 0(poly (log A, log n), that results 
in a nearly optimal strongly connected structure in the SINR model? 

We answer this question affirmatively, giving algorithms that match the best upper bounds 
known for centralized algorithms. This holds both for oblivious power assignments as well as when 
allowing arbitrary power assignments. In particular, using arbitrary power, we find and schedule 
a bidirectional tree in O(logn) slots that has the property that both aggregation computation and 
any pairwise communication can be achieved in optimal logarithmic time. 

The rest of the paper is organized as follows. We introduce the model and key definitions 
in Sec. El and discuss related results in Sec. [2j Our results are described in Sec. 01 Section [5] 
contains technical definitions and clarifications that are essential for the analysis but not needed to 
understand the results. The algorithm for the initial network construction is given and analyzed 
in Sec. [6l Our two approaches to finding extremely efficient schedules are presented in Sec. [7J and 
Sec. El respectively. Several proofs and construction details have been deferred to appendices. 

2. Related Work 

Connectivity was the first problem studied from a worst-case perspective in the SINR model. 
In a seminal paper, Moscibroda and Wattenhofer |21| formalized the problem and proposed an 
algorithm that connects arbitrary set of n points in 0(log 4 n) slots. This was improved to 0(log 3 n) 
[23] , 0(log 2 n) [20], and recently to O(logn) [11] . All these works deploy centralized algorithms. 
No non-trivial lower bound is known. Somewhat orthogonally, a large body of work exists on 
randomly deployed wireless networks, starting with the influential work by Gupta and Kumar [7J. 
Work in this setting for connectivity includes [1] , which studied the probability of there existing a 
path between two nodes in a randomly deployed network. In [25], minimum energy connectivity 
structures is studied for randomly deployed networks, but interference is essentially ignored. 
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Distributed connectivity of wireless networks has also been the subject of research. In [28], 
connectivity in mobile networks was studied from a graph-theoretic perspective with no explicit 
interference model. Indeed, connectivity maintenance problem has been well studied in control 
theory and robotics [281 021 E]> but with different underlying assumptions, typically without the 
use of the SINR interference model. Sensor connectivity has also been studied [13] without reference 
to any particular interference model. In [23], a heuristic was proposed for connectivity maintenance 
in multi-hop wireless networks. A more rigorous study was done in [27] but with the assumption 
of an underlying MAC layer that resolves interference problems. 

Two fundamental problems that deal with a given set of links relate to this work. Capacity: find 
the largest feasible subset of links, and Scheduling: partition the link set into the fewest number of 
feasible sets. For the former, constant-factor algorithms were given for uniform power 0IT2], mean 
and linear power (and most other oblivious power assignments) |10j . and power control [14] . These 
imply a logarithmic factor for the corresponding scheduling problems. Distributed algorithm was 
given for Scheduling with oblivious power |15] and shown to achieve 0(log n)-approximation [9]. 

Distributed algorithms have also been given for local broadcasting [6] and dominating set |26j in 
the SINR model. Both of these problem are, however, local in nature. 

The Minimum-Latency Aggregation Scheduling problem is closely related to connectivity, where 
the latency for transmitting messages to a sink is to be minimized. A large literature is known, 
but the first worst-case analysis in the SINR model was given in [16], with a 0(log 3 n) bound 
on the schedule length by a centralized algorithm and O(logA) by a distributed algorithm. The 
centralized bound was improved to optimal O(logn) in [TT] . 

3. Model and Preliminaries 

Given is a set P of n wireless nodes located at points on the plane. Without loss of generality 
assume that the minimum distance between any two points is 1. The nodes have synchronized 
clocks, and start running the distributed algorithm simultaneously using slotted time. Each node 
knows its location and has a globally unique ID. A single message is large enough to contain the 
ID and the location of a node. A receiver of a message thus always knows its distance from the 
sender and can identify the sender uniquely. 

A link is a directed edge between two nodes, indicating a transmission from the first node (the 
sender) to the second (the receiver). A link between u and v is denoted by (u,v); t will also be 
used to indicate a generic link. A link set L naturally induces a set of senders S{L) and a set of 
receivers R(L). The link (y, x) is known as the dual of link (x,y), following [15J. A link set X is 
a dual of set Y if X consists of the duals of the links in Y . The degree of a node u in a linkset L 
is the number of links incident on u in L. The distance between two nodes u and v is denoted by 
d(u,v) (this is also the length of the link (u, v)). Let A denote the maximum length of a possible 
link. A length class refers to a set of links whose lengths differ by a factor of at most 2. 

In the SINR model of interference, a non-transmitting node v successfully receives a message 
transmitted by node u if, 

where N is the ambient noise, f3 is the required SINR level, a > 2 is the so-called path loss constant, 
P w is the power used by node w, and S is the set of senders transmitting simultaneously. A set 
L of links is feasible if the above constraint holds for all v 6 R{L) where S = S(L). We do not 
impose any limit on the power a node can use. 

The goal is to identify a set T of links that both strongly connects the wireless nodes and can 
be scheduled efficiently (i.e., can be partitioned into few feasible sets). Additionally, we seek low 
latency constructions. 
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A converge-east tree is a directed rooted spanning tree where all links are oriented towards the 
root (i.e., for each link, the receiver is a parent of the sender). An aggregation tree is a converge- 
east tree along with a schedule of the links that has the property that each link (x, y) in the tree 
is scheduled after all links involving descendants of x. A dissemination tree is the opposite: a 
broadcast tree (spanning arborescence) with links oriented away from the root, with the opposite 
property for the schedule. In both cases, the scheduling order follows link directions and paths in 
the trees. 

Definition 1. A bi-tree is an aggregation tree with a complementary dissemination tree, using the 
same links in the opposite direction and same schedule in opposite order. 

Note that with a bi-tree, any node-node communication can be achieved within time equal to 
the length of the schedule. The same holds for computing an aggregation or a broadcast. 

The following power assignments are of interest. An oblivious power assignment is one where 
power assigned to a sender u is a (simple) function of d(u,v), where v is the intended receiver. 
The oblivious assignment we are most interested in is the "mean power" assignment Ad where 
= d(u,v) a / 2 . We also use uniform power ti that assigns the same power to all transmitting 
nodes, and the "linear power" assignment £ where = d(u,v) a . Note that a sender can transmit 
to different receivers at different times, and may use different powers. Finally, we also consider 
solutions achievable with arbitrary power assignments, where the algorithm is free to use any 
assignment. We let T = O (log log A + logn) denote the best ratio known for the cost of using 
oblivious power; namely, it is known that for any set of links, the ratio between the maximum size 
of feasible subset using arbitrary power vs. using mean power is at most T [8l [TO] . 

4. Our Results 

We give the first distributed algorithms with performance guarantees for connectivity problems 
in the SINR model. We first provide a basic algorithm: 

Theorem 2. There exists a distributed algorithm that computes a bi-tree T in 0(log A -logn) slots. 

We can improve this solution by using scheduling with non-uniform (but oblivious) power as- 
signments. Recall T = O (log log A + logn). 

Theorem 3. The bi-tree T can be re-scheduled in 0(T • log 3 n) slots using mean power. 

We then intersperse the connectivity-building and the scheduling to get solutions matching the 
best centralized solutions known. 

Theorem 4. There exists a distributed algorithm (building on the first one) that finds and schedules 
a bi-tree in O(logn) slots (with arbitrary power), using time 0(T • log A • logn). A variation finds 
and schedules a bi-tree in 0(T ■ logn) slots with mean power, using time 0(Tlog A • log 2 n). 

In particular, the bi-tree property ensures that aggregation, broadcast, and pairwise communi- 
cation can all be achieved in optimal O(logn) steps. 

Technically, this work combines ingredients from numerous recent works on the SINR model 
[HJ [TOl [T4"l [151 13 HI]. In addition, we derive a number of properties, most of which deal with 
the concept of affectance in relation to connectivity structures; intuitively, affectance measures the 
interference of one transmission on the reception of another transmission, relative to the signal 
strength of the latter. We explicitly define a previously considered geometric property of sparsity, 
and show it to imply small average affectance. We give novel algorithms for finding large feasible 
subsets in such sparse link sets. And, we introduce randomized transmission strategies to estimate 
affectance in terms of transmission successes. 
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5. Technical Notes 



Our algorithms require the following knowledge about the instance: The number of nodes, n, 
up to a polynomial factor; the minimum distance (assumed to be 1); and the maximum distance 
A. We do not treat A as a constant, although it is small in many systems. Knowledge of A is 
mainly needed for stopping criteria; it can be avoided by computing the size of the tree, if precise 
knowledge of n is available. 

In describing our algorithm, we refer to some messages as broadcasts and some as acknowledg- 
ments. In terms of if and how these messages succeed, they are identical and work as dictated by 
Eqn.[TJ The difference lies in when these messages are transmitted and what they contain. A broad- 
cast refers to an exploratory message sent to no node in particular, only containing the sender's 
ID and location. An acknowledgment is transmitted as a response to a previous message (typically 
a broadcast) and contains IDs of both the sender (the acknowledger) and the initial broadcaster. 
Thus, receivers receiving an acknowledgment can determine if it was addressed to them or not. 

All our results are proved to be true "with high probability" (w.h.p., for short), where the term 
means that the relevant event occurs with probability 1 — -4, for some suitably large c0. We 
frequently prove a lemma to hold, w.h.p., for a node it, or a link (u, v). It will always be clear 
that such a result can be safely union bounded over all nodes, or all possible links, to derive a 
high probability result for the whole algorithm. The only case that needs care is when we union 
bound over slots in the algorithm. The number of slots in our first algorithm is a function of log A, 
which can be arbitrarily larger than n. Union bounding is still safe for the following reason. The 
algorithm proceeds by considering links belonging to the same length class, and there can be at 
most log A of such classes (thus the dependence on log A). However, since there are at most n 2 
links in the network, only n 2 classes can actually be non-empty (in the full version, we provide a 
more refined 0(n) upper bound). During empty length classes, nothing happens with probability 
1 and thus the union bounding incurs no "cost" . 

Affectance. We use the notion of affectance, introduced in [5j [12] and refined in [15] to the 
threshold-ed form used here. The affectance a^(£) on link £ = (u, v) from a sender w, with a given 
power assignment V, is the interference of w on u relative to the power received, or 

where e is some arbitrary fixed constant (say 0.1), c(u,v) = (3/(1 — j3Nd(u,v) a / P u ) depends only 
on the parameters of the link £. We drop V when clear from context. For a set S of senders and a 

lmk£,as(£) = E w& s a ^ i )- 

Using such notation, Eqn. [T]can be rewritten as as{£) < 1, which we adopt. When dealing with 

links £ = (u, v) and £' = (u' , v') we mean a^(£') to mean a u (£'). Extending this to a link set L, we 

use the notation ai(£) to mean as{£) where S = S(L) are the senders in L. For two sets X and 

Y, ax(Y) thus means YleeY a S(x)(^)- From its definition, it is clear that c(u, v) > (3. We require 

that c(u, v ) < 2/3, and point out how to achieve this during the description of the algorithms. This 

simply means that nodes always transmit with power high enough for the intended (or potentially 

intended, in case of a broadcast) links to comfortably succeed in the presence of noise (but no other 

interference) . 

6. Initial Tree Construction 

The general template for the algorithm is as follows. At any given time, a subset of the nodes 
is active, with initially all nodes active and in the end only one node. Links are formed between 
pairs of active nodes, by a node u broadcasting, and another node v acknowledging that message 

^This can be amplified to hold for any c, by scaling up the constant factors. 
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in the next round. When such a communication succeeds, links (u, v) and (v, u) become part of 
the network and node u becomes inactive (and forms no further links). The still active node v is 
it's parent in the eventual aggregation tree. The link (u, v) is then part of the aggregation tree and 
the link (v, u) is part of the dissemination tree. 
In what follows, Ai, A2 . . ., 71, 72 • • • are constants. 

The algorithm proceeds in [log A] rounds, each containing Ai log n slot-pairs (a slot-pair is simply 
two consecutive slots). Each node u maintains a link set L u storing incoming and outgoing links 
along with a time stamp. The final set T is then simply U U L U . In this initial tree construction, 
slots in the schedule of the links correspond simply to the time stamps. 

At the beginning of each slot-pair in round r, each active node decides to be a broadcaster with 
iid probability p (p < \ to be determined), and listener otherwise. Then, 

• During the first slot, a broadcaster u transmits a message and a listener v listens for mes- 
sages. 

• During the second slot, a listener v that received a message from u such that 2 r ~ 1 < 
d(u, v) < 2 r in the previous slot does the following with iid probability p: add the links 
(it, v) and (v, u) to L v with appropriate slot numbers and return an acknowledgment. A 
broadcaster u listens for acknowledgments during this slot, and on receiving one (say, from 
v) adds (u, v) and (v,u) to L u , and becomes inactive. 

Note that a node only forms links with nodes at distance in the range [2 r ~ 1 ,2 r ) during round 
r. Since each node knows this range it can easily choose a power that ensures c(u, v) < 2(3 for all 
d(u,v) € [2 r -\2 r ). Setting the power to 2(3N2 ra suffices. We say that a link (u, v) is successfully 
formed between nodes u and v during a slot-pair if all of the following happen: a) the transmission 
(u, v) is successful in the first slot, b) it is successfully acknowledged in the second slot (i.e., the link 
{v , u) successfully transmits), and c) both nodes store (u, v ) and {v , u) in their set of links with the 
appropriate time stamps. Note that when this happens, u becomes inactive, by the description of 
the algorithm. The sole link that is outgoing from a given node is also the last one to be scheduled, 
thus ordering satisfies the leaf-to-root order of aggregation trees. 

Remarks. Two technical clarifications. First, note that a listener v can store a failed link, since 
it does not necessarily know whether an acknowledgment (v, u) succeeded. However, this is not a 
problem, since: a) Node u remains active if the acknowledgment fails and connects itself later to 
some node (or eventually becomes the root), b) Transmission of the link (v,u) is not problematic 
for other links, since links transmitting in that slot did succeed in the presence of that transmission. 
In any case, it is easy to efficiently "clean up" such stray links after the whole network is formed. 
Second, as constructed, the dissemination tree has the opposite order of links in the schedule (links 
closer to the root are scheduled later, instead of earlier, as the definition calls for). This is also 
easily fixable after the network is formed by a reversal process initiated by the root. We omit these 
details in this version. 

6.1. Analysis. We first show that short links have a high probability of succeeding. 

Lemma 5. Assume that at the beginning of round r, the minimum distance between active nodes 
is at least 2 r ~ 1 . Consider any slot-pair in the round and active nodes u and v with d(u,v) < 2 r . 
Then, with probability at least \p 2 (l — p), the link (u,v) is successfully formed in that slot-pair. 
Similarly, with probability at least \p 2 {l —p), the link (v,u) is successfully formed. 

Proof. Let p = 2 r ~ 1 . Let M r be the set of currently active nodes and let I = (u,v). Let B r be the 
set of broadcasters during the slot-pair. First, note that by the description of the algorithm 



F(u G B r and v B r ) = p(l - p) 
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For t = 0, 1, . . . define C% to be the ball around v of radius p(t + 1) and define the annulus At as 
A) = Co 5 ^4* = C^ \ Ct_i for f > 1. From this it is easily computed that the area of At is 

(2) Area(At) =irp 2 (2t + l) 

Now, by the definition of p, balls of radius | around any pair of points in M r do not intersect 
(since the minimum distance between active nodes is p). Combining this with Eqn. [21 we see that 
At contains at most 16(2i + 1) < 48i nodes in M r . 

For x G M r n Aq, a x (£) < 1 + e, simply by the definition of affectance. For x G M r fl At for i > 1, 

> p ■ t and thus a x (£) < q < 2/3 (f) a , where q = c(u, t>) < 2/3. Note that for any x, 

F(x G B r ) = p. 
Thus, 



E(o flr (f)) = E(a Br nA W) + ^E(a Br . n A t (*)) 

i>l 

< 16(1 + e)p + p2/3^ f-") 48t 



< 16(1 + e)p + 96p/32 c 



1 



a-2 

using the bound C( a: ) = Sn>i ^ — 5^1 on the Riemann zeta function. Thus, for any p < 
(64(1 + 6/32 Q ^2))" 1 , we get that E(a Br (f» < 1/2. By Markov's inequality, a Br {i) < 1 with 
probability at least | (recall that this means that the link I succeeds). Thus, 

V{a Br (l) < 1 and u G B r and v £ B r ) > -p(l - p) , 

A similar argument proves that the probability of £ r = (v, u) succeeding is at least \p and thus the 
link (u,v) is formed with probability at least |p 2 (l —p)- The argument for the potential formation 
of link (v, u) is identical. □ 

Now we can claim that, 

Lemma 6. At the beginning of each round r, the distance between active nodes is at least 2 r ~ 1 , 
w.h.p. 

Proof. (Sketch.) The claim is clearly true for round 1 (since the minimum distance in the system 
is 1). Now inductively assume that it is true for round r. Consider any two nodes u, v that are 
active at the beginning of round r + 1 with d(u,v) < 2 r+ . Consider any slot-pair in which they 
are both active. By Lemma the probability of both of them remaining active after this slot pair 
is at most 1 — |p 2 (l — p) < 1 — \p 2 ■ Thus, the probability of both of them remaining active over 
Ai logn slot-pairs is < (1 — |p 2 ) Allogra . Setting Ai = p?, this probability can be upper bounded by 
This proves the Lemma (after union bounding). □ 

We can now prove the first main result. 

Proof, [of Thm. [2] By Lemma [6] it is clear that within 0(log A) rounds, and thus 0(logA • logn) 
slots, at most one active node remains (since the maximum distance among nodes is A). Since nodes 
only cease to be active by forming links with an active node, it is also clear that exactly one node 
remains active. When nodes cease to be active, they do so only by connecting in both directions 
to still-active nodes (by the description of the algorithm). By induction, the whole network is then 
strongly connected to the single node active at the end. This last active node is the root of both 
the aggregation and dissemination trees. □ 
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We can also show that the network formed has low degree, where the degree of a node is its 
number \L U \ of incident links. 

~p 2 d 

Theorem 7. The probability of a link having degree d is at most e b . As a result, the maximum 
degree is O(logn), w.h.p. 

Proof. Let u be a node and consider any round r and any slot-pair in the round where u is active. 
Suppose there is another active node v with d(u,v) < 2 r . Then by Lemma [5j u ceases to be active 
after this slot-pair, with probability at least \p 2 (l — p) > \p 2 - Note that in slot-pairs where no 
such v exists, u does not form a link. Thus, the degree of a node is upper bounded by the number 
of slot pairs where such a v exists, and u remains active after wards. The probability of there being 

d such slot pairs is at most (1 — p>) < e~ ~ . 

Setting d = ^80 log n gives us the second part of the lemma. □ 



In this section, we show that the link set T produced by the algorithm of Sec. [6] can actually 
be scheduled in considerably fewer slots (in terms of dependence on A) using mean power, thus 
proving Thm. [3l This leads to an algorithm to reschedule the same links with this improved power 
assignment. The main idea is to show that the produced link set has certain geometric properties 
that allows such improved scheduling. 

Definition 8. A set L of links is vp-sparse if, for every closed ball B in the plane, 

B n L(8 ■ rad{B)) < ip , 

where rad(B) is the radius of B, L(d) is the set of links in L of length at least d, and B(~)Q denotes 
the links in a set Q with at least one endpoint in ball B. 

It was shown in [11] that the sparsity property (not explicitly defined there) is connected to a 
property named amenability in [11], which via an algorithm in |14j and results in [10] imply the 
following: 

Theorem 9 ([H]). Let L be a ^-sparse link set, for some ip > 1. Then any L' C L contains a 



can be scheduled in 0{ip ■ T • logn) slots using mean power. 

We provide a short overview of these ideas for reference in Appendix [Bj 
We now claim a sparsity result for the network T formed by the algorithm. 

Lemma 10. If D is a disc of radius p in the plane, then the number of links in T longer than 8p 
that have at least one endpoint in D is O(logn), w.h.p. 

Proof. Let L = L(8 ■ p) fl D. We first claim that at most one node inside D is incident to a link in 
L. For contradiction, assume that there are two such nodes u and v. Now, by the description of 
the algorithm, links of length 8p or higher can only be formed in rounds log p + 4 or higher. Thus, 
both u and v were active during round log p + 4. However, d(u, v) < 2p and thus by Lemma [6j at 
the end of round log p + 2, at most one of them could remain active. This is a contradiction. The 
proof of the Lemma is now complete by Thm. [71 □ 

By union bounding over all p and all balls (by careful selection, there are only polynomially 
many of them that are relevant), this implies: 
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7. Sparsity and Power Control 




L'CI contains a subset of size $7 (]pf) that is feasible under 



Theorem 11. The set T of links produced by the algorithm is O (log n) -sparse. 



We now propose the following extension of the algorithm to schedule the links using significantly 
fewer slots. 

The sender of each link t in T sets its power to mean power, l a / 2 . The links then 
use the distributed algorithm from [15] to compute a schedule of the links using this 
power assignment. 

We can now prove Thm. [3] 

Proof. Thm. [9J and Thm. [11] imply that T can be scheduled in 0(T • log 2 n) slots using mean 
power. The distributed scheduling algorithm of JT5] produces a 0(logre)-approxhnation [9], giving 
the Theorem. (See Appendix O for a technical note on the approximation factor in [9J). □ 

The resulting schedule, however, does not necessarily satisfy the ordering property of bi-trees. 

8. Matching Centralized Bounds 

In this section, we prove Thm. HJ The difference with Sec. [7J are threefold. First, we achieve 
more efficient final schedules. Second, unlike Sec. we produce bi-trees. The third is a difference 
in approach. While the algorithm in Sec. [7] merely rescheduled the links in the original tree, in this 
section, we shall actually build a new tree with superior properties, but will do so by using the 
original tree. 

We use Init to refer to the algorithm from Sec. that constructs the initial bi-tree. For any link 
set L which is a subset of a directed rooted tree, we call a node u a "top level node" with respect 
to L if no link of form (v,w) is in L (i.e., the link between v and its parent in the rooted tree, if 
such a link exists, is not present in L). 

In what follows, we focus on forming the aggregation tree part for simplicity (constructing the 
dissemination tree portion of the bi-tree is essentially identical). The algorithmic framework is as 
follows. 



Algorithm 1 TreeViaCapacity 

1: Set i = and Pi = P (the original input set). 

2: for i = 0, 1,2 . . . until \Pi\ = 1 do 

3: Construct (aggregation) tree T on Pj using Init. 

4: Find a feasible subset T' C T 

5: Let Pffi be the set of top level nodes w.r.t. T' . 

6: end for 



If T' is large, then this process ends quickly. 

Theorem 12. Assume that in each iteration, E(|T"'|) = S\T\ for some 5 > 0. Then, the process 
ends after 0{\ log n) iterations and the links produced form an aggregation tree connecting the nodes 
mO(ilogn) slots, w.h.p. 

Proof. First we show that: 

Claim 8.1. E(|P m |) < (1 - \b~)\Pi\, for any Pi such that |P<| > 2. 

Proof. Suffices to pro Recall that \T'\ > 5\T\ = o~(\Pi \ — 1). Consider any link (u,v) £ T' ■ Clearly, 
this link rules out u as a top level node. Also, since T is an aggregation tree, there can be at most 
one outgoing link from each node u. Thus, E(|P m |) < |P|-E(|T|) < |P|-5(|P|-1) < (l-i<5)|P| 
(for \Pi\ > 2). □ 
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This can be used to show that the process ends in logn) steps, w.h.p. 
Claim 8.2. P(|P t | > 1) < £ for t = 10^ logn. 

Proof. Since Pi is non-increasing in i, for contradiction, condition on all Pi > 2 for i < t. Then we 
can apply the above Lemma to show that 



By the definition of top level nodes, nodes not in Pj+i are connected to some node in Pj by a 
link. Thus, the final structure is clearly a converge-east tree. The ordering on schedules is also 
guaranteed by the way the algorithm proceeds (it is easy to see that nodes can be involved in at 
most one link in a feasible set, thus the ordering is not violated within T'). 

Finally, since each iteration uses a single slot, the bound on iterations immediately implies the 
bound on the number of slots in the schedule. The theorem follows. □ 

To implement the above scheme, we need to show that T' can always be found for a large enough 
5 to claim the results in Thm. [U 

We do this in two steps: in the first step a 0(l)-sparse subset T(M) C T is chosen, and in 
the second step a subset of T(M) is chosen as T' ■ The first step is identical for mean power and 
arbitrary power case. The set T{M) is defined in the following result, whose proof is in Appendix 



Theorem 13. Let M be the set of nodes of degree at most T , and let T(M) be the links 

in T induced by M. Then, T{M) is 0(l)-sparse and E(\T(M)\) = fi(|T|). 

To actually compute T(M) in a distributed fashion, note that nodes can easily decide if they are 
in M (by counting the number of links adjacent to them). One sweep through the existing network 
T is enough for each node to detect which of their links (if any) are in T(M). 

Selecting T' is also reasonably easy for mean power, but more involved for arbitrary powers. The 
following two subsections deal with these cases separately. Note that we keep the original network 
around at all times, which is useful for controlling the construction of the new one. Running these 
networks in parallel can be achieved with simple time-division multiplexing. 

8.1. Finding T' with mean power. Assume that T{M) is known. It can be shown that the 
average affectance in the linkset T{M) (under mean power) is small, or O(T) (proof in Appendix 



Lemma 14. Affectance within T(M) under mean power satisfies a^ M ^(T(M)) = 7iT|T(M)|, for 
some constant 71 . 

Lemma [T41 implies, after some basic manipulation, that there exists Q with |Q| > ^|T(M)|, such 
that atf {M) {i) < 2 7 i T for alH e Q. 

The following sampling mechanism produces a large feasible set in expectation (see [3]): Each 
link in T(M) transmits with iid probability 4 ^ T , with the successful links forming the set T' . 

Since each transmitting link in Q succeeds with probability > i, the expected size of T' is at least 
2^y|Q| = tt(^\T(M)\). Combining this with Thm.fTB"! we get that 

Lemma 15. E(|T|) = ft(±E(\T(M)\)) = Sl($\T\). 

Thus, Thm. [TZI can be invoked with 5 = f^y), to obtain the second half of Thm. 
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from which the claim follows by Markov's inequality. 



□ 



El 



ED. 



Theorem 16. There exists a distributed algorithm that forms and schedules a bi-tree in O(T-logn) 
slots using mean power. This algorithm completes in time 0(YlogA • log 2 n). 

Proof. The performance of the final solution follows from Thm. \12\ as mentioned above. Let us 
the bound the total running time. The algorithm Init needs to be invoked 0(T • logra) times, for 
a total cost of 0(T • log A • log 2 n). After forming T with each such invocation, identifying T(M) 
costs 0(log Alogra) (the cost of T). Computing T 1 is cheap since the sampling is done in parallel. 
One technical aspect to note is that while the nodes choose T 7 , they nodes need to know if their 
transmission succeeded; this can be done without substantial loss of performance using an extra 
acknowledgment slot, as we have seen before. The runtime bound of the theorem then follows. □ 

This theorem completes the proof of the second half of Thm. [U 

8.2. Finding T 7 with arbitrary power. In this case, we want to find a large set T', given T(M), 
and then choose a power assignment making the set feasible. 

We start with the link selection step. Leveraging the fact that our input instance T(M) is sparse, 
we implement a distributed version of a centralized algorithm for choosing such a set proposed in 

M- 

The following algorithm was shown in [13] to give constant factor approximation for finding the 
largest feasible subset of any given linkset: Given a linkset R, let the selected set be L, initially 
empty. Go through all links in ascending order of length (breaking ties arbitrarily) . If the condition 

(3) 4(e) + $(L)<T, 

holds, for a constant r, then the link £ is added to L (Eqn. 1 of [2] can be seen to be essentially 
equivalent to the above equation). 

For simplicity, we assume in this abstract that that receivers can measure the SINR of a successful 
link (i.e., can measure if the link succeeded with a desired threshold r or not). This assumption 
can be removed. 

Assume the formation of T using Init required R rounds. Our selection algorithm Distr-Cap 
has the following outline. 

Distr-Cap contains R phases. In phase i, links in T(M) that were formed in round 
i of Init decide whether or not to add themselves to the selected set T' ■ 

By the description of Init, links formed in the same round belong to the same length class (also, 
links formed in a particular round are smaller than all links formed in later rounds). 

For all i, phase i of Distr-Cap consists of one slot-pair. Let Q be the links participating in this 
phase (i.e., links formed during round i of Init). During the first slot of the phase, the following 
happens: 

(1) All links I in T' (the set selected so far) transmit using linear power (i.e. Pi = £ a ). 

(2) Links in Q transmit with iid probability p (small constant) using linear power. 

(3) Receivers in Q record a success if they received a message across the link with SINR < r/4. 
Let Q be the set of links that recorded success. 

During the second slot: 

(1) Links in Tj (dual of T') transmit using linear power (i.e., the receivers of T' transmit using 
linear power). 

(2) Links in Qd (dual of Q) transmit with iid probability 7! • p for some 72 < 1, using linear 
power. 

(3) Receivers in Qd record a success if they received a message across the link with SINR < ^p. 

Thus, at the end of a second slot, a success is recorded at a sender of a (original) link in Q, if the 
transmission succeeded in both directions (the original link and the dual) with the required SINR 
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threshold. Let Q* be the set of links that succeeded. The updated solution is then T' <- T'UQ*, 
which simply means that links add themselves to T' if they succeeded in both directions. 

We now analyze this algorithm. The following sub-subsections show that the selected solution is 
feasible and large (a constant factor approximation to the largest feasible subset), respectively. 

8.2.1. T 7 is feasible. We now show that T' satisfies Eqn. [3l It suffices to show that for all i € T', 
if L C T' are the links no larger than I then: 

ai(e) + a^(L)<r . 

The following two Lemmas imply the above. 

Lemma 17. a£(i) < \. 

Proof. To see this, note the selection of Q in the first slot of each slot-pair. We claim that during 
this slot, all links in L are transmitting with linear power. For links in L that were selected in 
an earlier phase, this is obviously true. For links in Q that will be selected in L, this is true as 
well, since eventual admission in L is only possible (though not guaranteed) if the link decided to 
transmit during the first slot. 

The proof of the Lemma is completed by noting the SINR threshold used in the selection of 
Q. □ 

Lemma 18. a u t (L) < \. 

Proof. The selection process implemented during the second slot guarantees that a£ (id) < 
where Ld is the dual set of L and id is the dual of i (this follows the proof of the previous Lemma 
almost verbatim). 

To complete the proof, we use a result from |15} Obs. 4]. It was shown that for a constant 72, 
and links i and i', 

Claim 8.3. 72 af, (i d ) < a%(£') < ±af, (t d ). 
Using this claim, we get that 

7tL d ^ 4 

as required. □ 

8.2.2. V is large. Define, following [Til Hi] . 

a y,(i) + af(i') jf£<£!, 



ft 



otherwise. 



This definition is essentially equivalent to the definition of that of fz(i') of [llj and of w(i,i') of 
|14j (also see Appendix [Bj) . Those definitions are presented in terms of distances. The reason why 
we choose to define fe.it.') in terms of affectances here, instead of distances, is that affectances (or 
their SINR equivalents) can be measured by the link receivers and thus used as a selection criteria. 
For a set X, define f e (X) = £^ e x h{t') and f x (i') = Etex ft(?} 

Recall that the input set T is 0(l)-sparse, which is of crucial importance. Consider once again 
the execution of the algorithm for phase i. Let T i '_ 1 be the selected set at the end of phase i — 1. As 
before, let Q be the links considered in phase i and Q* be the links that succeeded in that phase. 
Since T is 0(l)-sparse, so is Q. 

Lemma 19. Let Q' be the subset of links i in Q with f v (i) < 7 | • r/8. Then, E(|Q*|) = 0(|Q'|). 
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Proof. Consider any link £ G Q'. We shall show below that ¥(£ G Q*) = 0(1), which implies the 
Lemma. 

In the first slot, £ transmits with probability p. We claim that: 
Claim 8.4. P(aj,(£) < r/8) > \, where T are the links in Q transmitting. 

Proof. Let p be such that length class in phase i covers lengths in [p, 2p). Since Q is 0(l)-sparse, 
it follows that balls of radius p contain only a constant number of nodes that have links in Q. 
The claim now follows from arguments essentially identical to those in Lemma El after setting the 
probability p sufficiently small. □ 

Since £ G Q' , we see that a£, (£) < r/8, by the definition of Q' . Thus, if aff{£) < t/8, then 

i— 1 

a TuT( W — r /^' anc ^ transmission is recorded as a success. Thus, £ transmits and is recorded 
as a success with probability \p. In other words, 

(4) ¥{£ eQ)> l -p. 

Now, condition on £ being in Q. Then £ d transmits with probability ^y 2 P- The following claim 
can be proven using Claim 18.31 and is similar to Claim 18.41 



Claim 8.5. F(aj, d {£d) < -^p) > \, where C Q d are the (dual) links transmitting in this slot. 

Following a argument similar to the one used for the first slot, we see that in the second slot, 
such a transmission is recorded as a success as well. 

Thus, P(£ G Q*\£ G Q) > \^ 2 V- Combining this with Eqn. H we get F(£ £ Q*) > \<y 2 p 2 = 0(1), 
completing the proof of the Lemma. □ 

This leads to the desired bound on the size of T' . 

Theorem 20. The set T' chosen by the algorithm satisfies E(|T'|) = 0(|7"(M)|). 

Proof. By Thm. O there exists a set O C T such that O is feasible and \0\ = 0(|T|). Thus, it 
suffices to show that E(|T'|) = 0(|0|) for any feasible set O. 

Thm. 1 of [14] shows that for a feasible link set R and any link £, 

(5) h(R) = O(l) . 

Consider the set R = O \ V. We divide R further into two subsets: R\ = {£' G R : fr>(£') > 
7|r/8} and R 2 = R\Ri- Summing Eqn. [5] for all £ G V , 

(6) f T {R) = 0(\T'\). 

By definition of Ri, f-j-i(Ri) > |7|t|-Ri|. Assume first that \R\\ > \R\/2. Then, we get, fq-i(R\) > 
jQjir\R\, which combined with Eqn. [6] gives, |T'| = 0(/ T '(-R)) > fv(Ri) = J2(|/2|). Since \0\ < 
\T'\ + \R\, this clearly implies that \T'\ = 0(|0|). Otherwise assume, < \R\/2 and thus |i?2| > 
|ii|/2. But Lemma [TU1 implies that 0(|i?2|) links were chosen by the algorithm (in expectation), 
from which E(|T'|) = 0(|0|) follows. □ 

8.2.3. Computing the power assignment. So far we have dealt with the selection of a large set of 
feasible links. Once the link set T' is identified, we must select the power assignments for this set. 
Given a set of links that are known to be feasible, there exists a large body of work proposing 
algorithms that converge to a power assignment making the assignment feasible. For example, two 
recent ones are [T7] and [2]. Using such an algorithm as a black box, we can find the appropriate 
power assignment. 
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Theorem 21. There exists a distributed algorithm that connects the nodes in O(logn) slots. As- 
suming that there exists an algorithm to find the power assignment for a feasible set in time r], this 
algorithm completes in time 0(logn(log A • logn + rf)). 

As an example, if we select the algorithm from [17J, n can be bounded by 0(logA(logn + 
log log A)). This proves the first part of Thm. UJ 

9. Conclusions 

Our distributed algorithms have efficiency and effectiveness that appear to be close to best 
possible. An interesting direction would be to treat dynamic situations, including asynchronous 
node wakeup, node and link failures, and mobility. 
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Appendix A. Missing Proofs 

Proof of Thm. 



Proof. Recall that M is the set of nodes of degree at most p = in T. For sets X and Y, 
let £(X, Y) be the number of links with senders in X and receivers in Y . We claim that setting 
T(M) = £(M, M) fulfills the properties claimed in the theorem. 

The 0(l)-sparsity follows by noting that the nodes in M have degree 0(1); the proof of Lemma [TPl 
can be followed verbatim using the constant-degree bound instead of the 0(log n)-bound employed 
there. 

Thus, what remains to be proven is that E(\£(M,M)\) = O(n) = fi(|T|). Let M' = P\M (recall 
that P is the set of all nodes). Since T is a tree, \T\ = n — 1. Then, since the number of unique 
links adjacent to M is at least |Mp, it is easily computed that \M'\ < ^ and thus |M|>n(l — |). 
We show in Lemma \TZ\ below that E(\£(M', P)\) < Note that since T is a connected tree, 
\£{M,P)\ > \M\ - 1. Thus, 

E(\£(M,M)\) > E(\£(M,P)\)-E(\£(M,M')\) 

> E(\M\) - 1 — E{\£{M\P)\) 

> n (i - -) - 5 = n(») , 



which implies the theorem. □ 
Lemma 22. E{\£(M',P)\) < 

-p 2 d 

Proof. Recall that by Thm. [71 ¥(deg(u) > d) < e s , where deg(u) is the degree of u. This 
implies that F(deg(u) € [d,2d)) < e~ ~ . Since p = we can verify using basic calculus that 
e P 2 p2'/8 > p22"2t+2^ £ Qr a rj ^ Using this bound, we get, 

oo 

E(\£(M',P)\) <nY^^{deg{u) G [ P 2\ P 2 t+1 ))p2 t+1 
t=o 

10-2* 



E —P P£ -,+4-1 \ — ^ ~P \ — ^ 
e s p2^ <n> e is <n> 

t=0 t=0 t=0 

oo 1 oo .. „ 

n s—^ 1 n n s—^ 1 An n 

To + n z^ e io-2* - ^ + f Tw2^^2 T -^To-^9 



□ 

Proof of Lemma 1141 

Proof. The proof of this Lemma follows ideas from [8] and [10]. We need to relate the idea of 
sparsity to the idea of "independence" used in [8]. 

We say that a set of links is q-independent if any two of them, i = (x, y) and £' = (x' , y'), satisfy 
the constraint d(x,y') ■ d(y,x') > q 2 d(x,y) ■ d(x',y'). 
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We claim, 



Claim A.l. Let C be a sufficiently large constant. Let Q be a C -independent set, and for any link 
£ in T' , let Q be the links in Q longer than I. Then, a^(Q ) + o,nt{€) = O(T). 

Proof. Partition 0/ into two sets: Qj, with links length at least d(x,y) ■ 2(2/3n) 2//a , and Oj, with 
the remaining links. It follows from Lemma 4.4] that a%(£) + af t (Qf) = O(loglogA). On the 

other hand, Q e s can be partitioned into O(logn) length classes. For such sets, it is known [8] that 
C-independence, for some constant C, implies feasibility. Let Z be such a set. By Lemma 7 of 
[15], a^{C) = 0(1). Since Z belongs to a single length class, it is also possible to show (following 



arguments similar to [15]) that af^(Z) = 0(1). Thus, a^it) + af i (Q ( s ) = O(logn), summing over 

S 

the O(logn) such Z's. The claim follows. □ 

By Lemma [23] below, we know that T' can partitioned into a constant number of C-independent 
sets. Let Q\, Q2, ■ ■ ■ , Qt be a partition of V into t different C-independent sets. For a link £, let 
Ql = {£> eQi :£'>£}. Then, 

aP(T')< £ X>^(<rf) + a£|(/) 

£=(x,y)€T' i=l 

= t\T'\0(T) = 0(\T'\T) , 

since t = 0(1). □ 

Lemma 23. T' can be partitioned into a constant number of C -independent sets. 

Proof. Consider any link £. We claim that there are O(l) links £' at least as long as £ such that £ 
and £' are not C-independent. This claim proves the lemma by the following algorithm. Sort the 
links in an ascending order of their length, breaking ties arbitrarily. 

Now consider the graph on links where there is an edge between links if they are not C- 
independent. 

By the claim, all links have O(l) edges to links after them in the ascending order. Such a graph 
is 0(l)-colorable, where each color represents an independent set in graph theoretic sense, and thus 
a C-independent set according to our definition. 

Now we prove the claim. Recall that T' is 73-sparse for some constant 73. Consider the link 
£ = (u, v) and a ball of radius (2C) 2 -d(u, v) around u. By a basic geometric argument, this ball can 
be covered by O(l) balls of radius d(u,v)/8. By the definition of sparsity, there can be at most 73 
links of length d(u, v) or higher that have one endpoint in each of the smaller balls. Thus, the larger 
ball also contains only 0(1) such links. We now claim that all other links, i.e., £' = (u',v') such 
that mm(d(u',u),d(v',u)) > (2C) 2 -d(u, v) are such that £ and £' are C-independent. First, assume 
that d{u'\v) > \d(u',v'). Then d(u',v) • d(u,v') > \d(u',v') ■ (2C) 2 • d(u,v) = C 2 d(u' ,v')d(u,v) 
which implies C-independence. On the other hand, if d(u', v) < \d(u' , v'), then d(u, v') > d(u', v') — 
d(u',v) — d(u,v) > d(u',v') — jd(u',v) > j^d(u',v'), from which C-independence follows by similar 
computations. □ 

Appendix B. A Short Primer on Sparsity, 
Amenability and Feasibility 

In [11], a set of links L was defined to be ^-amenable if the following holds: for any link £ (£ 
not necessarily a member of L), Yli'eL ft(^') — "Hi f° r a function / (see Eqn. 18, 2. 2D . for some 77. 
Actually, in [TT], 77 is implicitly considered to be a constant, and just the term amenable is used. 
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The definition extends naturally to arbitrary r/. It was shown in |14j that an r/-amenable set L 
has a feasible subset of size $7 ( -\L\ ) . 



Now the final ingredient needed is to tie sparsity to feasibility (and thus get Thm. [9]). We claim 
that sparsity as defined in this paper implies amenability. This is implicit in Specifically, in 
proving the main Lemma 4 of [11] . it is first shown that the structure in question (which happens 
to be a Minimum Spanning Tree on the set of nodes) is 0(l)-sparse (Lemma 5) and then this is 
used to show that the structure is amenable (which then implies a large feasible subset). 

Appendix C. A Note on the Approximation Factor for Distributed Scheduling 

If acknowledgments have to be explicitly implemented, the algorithm of [151 E] produces a sched- 
ule length of 0{{T + T') ■ logn), where T is the optimal schedule for the input link set, and T" is 
the optimal schedule for the dual set of the input set, which may be larger than O(Tlogn). For 
our instance, this problem simply is not relevant. The constructed link set T is its own dual, and 
thus a 0(log n)-approximation factor can be safely asserted. 
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